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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Bandman, Olga 

Hawkins, Phillip R. 
Hillman, Jennifer L. 
Lai , Preeti 
Goli, Surya K. 

(ii) TITLE OF THE INVENTION: NOVEL HUMAN SERINE 
CARBOXYPEPTIDASE 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0241 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: MPHGNOT03 

(B) CLONE: 443004 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Val Gly Ala Met Trp Lys Val He Val Ser Leu Val Leu Leu Met 

15 10 15 

Pro Gly Pro Cys Asp Gly Leu Phe His Ser Leu Tyr Arg Ser Val Ser 

20 25 30 

Met Pro Pro Lys Gly Asp Ser Gly Gin Pro Leu Phe Leu Thr Pro Tyr 

35 40 45 

He Glu Ala Gly Lys He Gin Lys Gly Arg Glu Leu Ser Leu Val Gly 

50 55 60 

Pro Phe Pro Gly Leu Asn Met Lys Ser Tyr Ala Asp Phe Leu Thr Val 
65 " 70 75 80 

Asn Lys Thr Tyr Asn Ser Asn Leu Phe Phe Trp Phe Phe Pro Ala Gin 

85 90 95 

He Gin Pro Glu Asp Ala Pro Val Val Leu Trp Leu Gin Gly Glu Pro 

100 105 110 

Gly Gly Ser Ser Met Phe Gly Leu Phe Val Glu His Gly Pro Tyr Val 

115 120 125 

Val Thr Ser Asn Met Thr Leu Arg Asp Arg Asp Phe Pro Trp Thr Thr 

130 135 140 

Thr Leu Ser Met Leu Tyr He Asp Asn Pro Val Gly Thr Gly Phe Ser 
145 150 155 160 

Phe Thr Asp Asp Thr His Gly Tyr Ala Val Asn Glu Asp Asp Val Ala 

165 170 175 

Arg Asp Leu Tyr Ser Ala Leu He Gin Phe Phe Gin He Phe Pro Glu 

180 185 190 

Tyr Lys Asn Asn Asp Phe Tyr Val Thr Gly Glu Ser Tyr Ala Gly Lys 

195 200 205 

Tyr Val Pro Ala He Ala His Leu He His Ser Leu Asn Pro Val Arg 

210 215 220 

Glu Val Lys He Asn Leu Asn Gly He Ala He Gly Asp Gly Tyr Ser 
225 ~ 230 235 240 

Asp Pro Glu Ser He He Gly Gly Tyr Ala Glu Phe Leu Tyr Gin He 

245 250 255 

Gly Leu Leu Asp Glu Lys Gin Lys Lys Tyr Phe Gin Lys Gin Cys His 

260 265 270 

Glu Cys He Glu His He Arg Lys Gin Asn Trp Phe Glu Ala Phe Glu 

275 280 285 

He Leu Asp Lys Leu Leu Asp Gly Asp Leu Thr Ser Asp Pro Ser Tyr 

290 295 300 

Phe Gin Asn Val Thr Gly Cys Ser Asn Tyr Tyr Asn Phe Leu Arg Cys 
305 310 315 320 

Thr Glu Pro Glu Asp Gin Leu Tyr Tyr Val Lys Phe Leu Ser Leu Pro 

325 330 335 

Glu Val Arg Gin Ala He His Val Gly Asn Gin Thr Phe Asn Asp Gly 

340 345 350 

Thr He Val Glu Lys Tyr Leu Arg Glu Asp Thr Val Gin Ser Val Lys 

355 360 365 

Pro Trp Leu Thr Glu He Met Asn Asn Tyr Lys Val Leu He Tyr Asn 

370 375 380 

Gly Gin Leu Asp He He Val Ala Ala Ala Leu Thr Glu Arg Ser Leu 
385 390 395 400 

Met Gly Met Asp Trp Lys Gly Ser Gin Glu Tyr Lys Lys Ala Glu Lys 

405 410 415 

Lys Lys Val Trp Lys He Phe Lys Ser Asp Ser Gly Val Ala Gly Tyr 

420 425 430 

He Arg Gin Val Gly Asp Phe His Gin Val He He Arg Gly Gly Gly 

435 440 445 

His Thr Leu Pro Tyr Asp Gin Pro Leu Arg Ala Phe Asp Met He Asn 
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450 455 460 

Arg Phe lie Tyr Gly Lys Gly Trp Asp Pro Tyr Val Gly 
465 470 475 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: MPHGNOT03 

(B) CLONE: 443004 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



AAGCGCTGCA 


AGGACAACCG 


GCTGGGGTCC 


TTGCGCGCCG 


GGCTCAGGGA 


GGAGCACCGA 


60 


CTGCGCCGCA 


CCCTGAGAGA 


TGGTTGGTGC 


CATGTGGAAG 


GTGATTGTTT 


CGCTGGTCCT 


120 


GTTGATGCCT 


GGCCCCTGTG 


ATGGGCTGTT 


TCACTCCCTA 


TACAGAAGTG 


TTTCCATGCC 


180 


ACCTAAGGGA 


GACTCAGGAC 


AGCCATTATT 


TCTCACCCCT 


TACATTGAAG 


CTGGGAAGAT 


240 


CCAAAAAGGA 


AGAGAATTGA 


GTTTGGTCGG 


TCCTTTCCCA 


GGACTGAACA 


TGAAGAGTTA 


300 


TGCCGACTTC 


CTCACTGTGA 


ATAAGACTTA 


CAACAGCAAC 


CTCTTCTTCT 


GGTTCTTCCC 


360 


AGCTCAGATA 


CAGCCAGAAG 


ATGCCCCAGT 


AGTTCTCTGG 


CTACAGGGTG 


AGCCGGGAGG 


420 


TTCATCCATG 


TTTGGACTCT 


TTGTGGAACA 


TGGGCCTTAT 


GTTGTCACAA 


GTAACATGAC 


480 


CTTGCGTGAC 


AGAGACTTCC 


CCTGGACCAC 


AACGCTCTCC 


ATGCTTTACA 


TTGACAATCC 


540 


AGTGGGCACA 


GGCTTCAGTT 


TTACTGATGA 


TACCCACGGA 


TATGCAGTCA 


ATGAGGACGA 


600 


TGTAGCACGG 


GATTTATACA 


GTGCACTAAT 


TCAGTTTTTC 


CAGATATTTC 


CTGAATATAA 


660 


AAATAATGAC 


TTTTATGTCA 


CTGGGGAGTC 


TTATGCAGGG 


AAATATGTGC 


CAGCCATTGC 


720 


ACACCTCATC 


CATTCCCTCA 


ACCCTGTGAG 


AGAGGTGAAG 


ATCAACCTGA 


ACGGAATTGC 


780 


TATTGGAGAT 


GGATATTCTG 


ATCCCGAATC 


AATTATAGGG 


GGCTATGCAG 


AATTCCTGTA 


840 


CCAAATTGGC 


TTGTTGGATG 


AGAAGCAAAA 


AAAGTACTTC 


CAGAAGCAGT 


GCCATGAATG 


900 


CATAGAACAC 


ATCAGGAAGC 


AGAACTGGTT 


TGAGGCCTTT 


GAAATACTGG 


ATAAACTACT 


960 


AGATGGCGAC 


TTAACAAGTG 


ATCCTTCTTA 


CTTCCAGAAT 


GTTACAGGAT 


GTAGTAATTA 


1020 


CTATAACTTT 


TTGCGGTGCA 


CGGAACCTGA 


GGATCAGCTT 


TACTATGTGA 


AATTTTTGTC 


1080 


ACTCCCAGAG 


GTGAGACAAG 


CCATCCACGT 


GGGGAATCAG 


ACTTTTAATG 


ATGGAACTAT 


1140 


AGTTGAAAAG 


TACTTGCGAG 


AAGATACAGT 


ACAGTCAGTT 


AAGC CATGGT 


TAACTGAAAT 


1200 


CATGAATAAT 


TATAAGGTTC 


TGATCTACAA 


TGGCCAACTG 


GACATCATCG 


TGGCAGCTGC 


1260 


CCTGACAGAG 


CGCTCCTTGA 


TGGGCATGGA 


CTGGAAAGGA 


TCCCAGGAAT 


ACAAGAAGGC 


1320 


AGAAAAAAAA 


AAAGT TTGGA 


AG ATC TTTAA 


ATCTGACAGT 


GGAGTGGCTG 


GTTACATCCG 


1380 


GCAAGTGGGT 


GACTTCCATC 


AGGTAATTAT 


TCGAGGTGGA 


GGACATACTT 


TACCCTATGA 


1440 


CCAGCCTCTG 


AGAGCTTTTG 


ACATGATTAA 


TCGATTCATT 


TATGGAAAAG 


GATGGGATCC 


1500 


TTATGTTGGA 


TAAACTACCT 


TCCCAAAAGA 


GAACATCAGA 


GGTTTTCATT 


GCTGAAAAGA 


1560 


AAATC GTAAA 


AACAGAAAAT 


GTCATAGGAA 


TAAAAAAATT 


ATCTTTTCAT 


ATCTGCAAGA 


1620 


TCTTTTTCAT 


CAATAAAAAT 


TATCCTTGAA 


ACAAAAAAAA 


AAAGAAAAAG 




1670 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 6 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: MMLR3DT01 

(B) CLONE: 566993 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO r 3 : 

Met Val Gly Ala Met Trp Lys Val lie Val Ser Leu Val Leu Leu Met 

1 5 10 15 

Pro Gly Pro Cys Asp Gly Leu Phe His Ser Leu Tyr Arg Ser Val Ser 

20 25 30 

Met Pro Pro Lys Gly Asp Ser Gly Gin Pro Leu Phe Leu Thr Pro Tyr 

35 40 45 

lie Glu Ala Gly Lys lie Gin Lys Gly Arg Glu Leu Ser Leu Val Gly 

50 55 60 

Pro Phe Pro Gly Leu Asn Met Lys Ser Tyr Ala Gly Phe Leu Thr Val 
65 70 75 80 

Asn Lys Thr Tyr Asn Ser Asn Leu Phe Phe Trp Phe Phe Pro Ala Gin 

85 90 95 

lie Gin Pro Glu Asp Ala Pro Val Val Leu Trp Leu Gin Gly Gly Pro 

100 105 110 

Gly Gly Ser Ser Met Xaa Gly Leu Phe Val Glu His Gly Pro Tyr Val 

115 120 125 

Val Thr Ser Asn Met Thr Leu Arg Asp Arg Asp Phe Pro Trp Thr Thr 

130 135 140 

Thr Xaa Ser Met Leu Tyr lie Asp Asn Pro Val Gly Thr Gly Phe Ser 
145 150 155 160 

Phe Thr Asp Asp Thr His Gly Tyr Ala Val Asn Glu Asp Asp Val Ala 

165 170 175 

Arg Asp Leu Tyr Ser Ala Leu lie Gin Phe Phe Gin lie Phe Pro Glu 

180 185 190 

Tyr Lys Asn Asn Asp Phe Tyr Val Thr Gly Glu Ser Tyr Ala Gly Lys 

195 200 205 

Tyr Val Pro Ala lie Ala His Leu He His Ser Leu Asn Pro Val Arg 

210 215 220 

Glu Val Lys He Asn Leu Asn Gly He Ala He Gly Asp Gly Tyr Ser 
225 230 235 ~ ~ 240 

Asp Pro Glu Ser He He Gly Gly Tyr Ala Glu Phe Leu Tyr Gin He 

245 250 255 

Gly Leu Leu Asp Glu Lys Gin Lys Lys Tyr Phe Gin Lys Gin Cys His 

260 265 270 

Glu Cys He Glu His He Arg Lys Gin Asn Trp Phe Glu Ala Phe Glu 

275 280 285 

He Leu Asp Lys Leu Leu Asp Gly Asp Leu Thr Ser Asp Pro Ser Tyr 

290 295 300 

Phe Gin Asn Val Thr Gly Cys Ser Asn Tyr Tyr Asn Phe Leu Arg Cys 
305 310 315 320 

Thr Glu Pro Glu Asp Gin Leu Tyr Tyr Val Lys Phe Leu Ser Leu Pro 

325 330 335 

Glu Val Arg Gin Ala He His Val Gly Asn Gin Thr Phe Asn Asp Gly 

340 345 350 

Thr He Val Glu Lys Tyr Leu Arg Glu Asp Thr Val Gin Ser Val Lys 

355 360 365 

Pro Trp Leu Thr Glu He Met Asn Asn Tyr Lys Val Leu He Tyr Asn 

370 375 " 380 

Gly Gin Leu Asp He He Val Ala Ala Ala Leu Thr Glu Arg Ser Leu 
385 390 395 400 

Met Gly Met Asp Trp Lys Gly Ser Gin Glu Tyr Lys Lys Ala Glu Lys 

405 410 415 

Lys Val Trp Lys He Phe Lys Ser Asp Ser Glu Val Ala Gly Tyr He 

420 425 430 

Arg Gin Val Gly Asp Phe His Gin Val He He Arg Gly Gly Gly His 

435 440 445 

He Leu Pro Tyr Asp Gin Pro Leu Arg Ala Phe Asp Met He Asn Arg 
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450 455 460 

Phe lie Tyr Gly Lys Gly Trp Asp Pro Tyr Val Gly 
465 470 475 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: MMLR3DT01 

(B) CLONE: 566993 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GAAAGCTGGT ACGCCTGCNG GTNCCGGTCC GGAATTCNCG GGTNGAC CC A CGCGTCCGAN 60 

CGACTGCGCC GCACCCTGAG AGATGGTTGG TGCCATGTGG AAGGTGATTG TTTCGCTGGT 120 

CCTGTTGATG CCTGGCCCCT GTGATGGGCT GTTTCACTCC CTATACAGAA GTGTTTCCAT 180 

GCCACCTAAG GGAGACTCAG GACAGCCATT ATTTCTCACC CCTTACATTG AAGCTGGGAA 240 

GATCCAAAAA GGAAGAGAAT TGAGTTTGGT CGGCCCTTTC CCAGGACTGA ACATGAAGAG 300 

TTATGCCGGC TTCCTCACCG TGAATAAGAC TTACAACAGC AACCTCTTCT TCTGGTTCTT 3 60 

CCCAGCTCAG ATACAGCCAG AAGATGCCCC AGTAGTTCTC TGGCTACAGG GTGGGCCGGG 420 

AGGTTCATCC ATGTTWGGAC TCTTTGTGGA ACATGGGCCT TATGTTGTCA CAAGTAACAT 480 

GACCTTGCGT GACAGAGACT TCCCCTGGAC CACAACGSTC TCCATGCTTT ACATTGACAA 540 

TCCAGTGGGC ACAGGCTTCA GTTTTACTGA TGATACCCAC GGATATGCAG TCAATGAGGA 600 

CGATGTAGCA CGGGATTTAT ACAGTGCACT AATTCAGTTT TTCCAGATAT TTC CTGAATA 660 

TAAAAATAAT GACTTTTATG TCACTGGGGA GTCTTATGCA GGGAAATATG TGC C AGC CAT 720 

TGCACACCTC ATCCATTCCC TCAACCCTGT GAGAGAGGTG AAGATCAACC TGAACGGAAT 780 

TGCTATTGGA GATGGATATT CTGATCCCGA ATCAATTATA GGGGGCTATG CAGAATTCCT 840 

GTACCAAATT GGCTTGTTGG ATGAGAAGCA AAAAAAGTAC TTC C AGAAGC AGTGCCATGA 900 

ATGCATAGAA CACATCAGGA AGCAGAACTG GTTTGAGGCC TTTGAAATAC TGGATAAACT 960 

ACTAGATGGC GACTTAACAA GTGATCCTTC TTACTTCCAG AATGTTACAG GATGTAGTAA 102 0 

TTACTATAAC TTTTTGCGGT GCACGGAACC TGAGGATCAG CTTTACTATG TGAAATTTTT 1080 

GTCACTCCCA GAGGTGAGAC AAGCCATCCA CGTGGGGAAT CAGACTTTTA ATGATGGAAC 1140 

TATAGTTGAA AAGTACTTGC GAGAAGATAC AGTACAGTCA GTTAAGCCAT GGTTAACTGA 1200 

AATCATGAAT AATTATAAGG TTCTGATCTA CAATGGCCAA CTGGACATCA TCGTGGCAGC 1260 

TGCCCTGACA GAGCGCTCCT TGATGGGCAT GGACTGGAAA GGTTCCCAGG AATACAAGAA 1320 

GGCAGAAAAA AAAGTTTGGA AGATCTTTAA ATCTGACAGT GAAGTGGCTG GTTACATCCG 1380 

GCAAGTGGGT GACTTCCATC AGGTAATTAT TCGAGGTGGA GGACATATTT TACCCTATGA 1440 

CCAGCCTCTG AGAGCTTTTG ACATGATTAA TCGATTCATT TATGGAAAAG GATGGGATCC 1500 

TTATGTTGGA TAAACTACCT TCCCAAAAGA GAACATCAGA GGTTTTCATN T 1551 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: COLNCRT01 

(B) CLONE: 770469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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Met Val Gly Ala Met Trp Lys Val lie Val Ser Leu Val Leu Leu Met 

1 5 10 15 

Pro Gly Pro Cys Gly Gly Leu Phe His Ser Leu Tyr Arg Ser Val Ser 

20 25 30 

Met Pro Pro Lys Gly Asp Ser Gly Gin Pro Leu Phe Leu Thr Pro Tyr 

35 40 45 

He Glu Ala Gly Lys He Tyr Thr Gly Thr Asn Ser Val Phe Gin He 

50 55 60 

Phe Pro Glu Tyr Lys Asn Asn Asp Phe Tyr Val Thr Gly Glu Ser Tyr 
65 70 75 80 

Ala Gly Lys Tyr Val Pro Ala He Ala His Leu He His Ser Leu Asn 

85 90 95 

Pro Val Arg Glu Val Lys He Asn Leu Asn Gly He Ala He Gly Asp 

100 105 110 

Gly Tyr Ser Asp Pro Glu Ser He He Gly Gly Tyr Ala Glu Phe Leu 
115 120 125 

jp«R rpy-r Qly ^ ^ s ^ s ^ ^ Q ln Lys 

|| 130 135 140 

|:| Gln Cys His Glu Cys Ile Glu His Ile Arg Lys Gln Asn Trp Phe Glu 

l| 145 150 155 ~ 160 

aJS Ala Phe Glu Ile Leu Asp Lys Leu Leu Asp Gly Asp Leu Thr Ser Asp 

fit 165 170 " ~ 175 

| s = Pro Ser Tyr Phe Gln Asn Val Thr Gly Cys Ser Asn Tyr Tyr Asn Phe 

It 180 185 190 

g Leu Arg Cys Thr Glu Pro Glu Asp Gln Leu Tyr Tyr Val Lys Phe Leu 

C3 195 200 205 

|| Ser Leu Pro Glu Val Arg Gln Ala Ile His Val Gly Asn Gln Thr Phe 

f| 2 10 215 220 

1 1 Asn Asp Gly Thr Ile Val Glu Lys Tyr Leu Arg Glu Asp Thr Val Gln 
JSj 225 2 30 235 ~ 240 

Im Ser Val L Y S Trp Leu Thr Glu Ile Met Asn Asn Tyr Lys Val Leu 

245 250 255 

He Tyr Asn Gly Gln Leu Asp Ile Ile Val Ala Ala Ala Leu Thr Glu 

260 265 270 

Arg Ser Leu Met Gly Met Asp Trp Lys Gly Ser Gln Glu Tyr Lys Lys 

2 7 5 280 285 

Ala Glu Lys Lys Val Trp Lys Ile Phe Lys Ser Asp Ser Glu Val Ala 

290 295 300 

Gly Tyr Ile Arg Gln Val Gly Asp Phe His Gln Val He lie Arg Gly 
305 310 315 320 

Gly Gly His Ile Leu Pro Tyr Asp Gln Pro Leu Arg Ala Phe Asp Met 

325 330 335 

Ile Asn Arg Phe Ile Tyr Gly Lys Gly Trp Asp Pro Tyr Val Gly 
340 345 350 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : COLNCRT01 

(B) CLONE: 770469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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AAGGCAAACC GGCTGGGGTC CTTGCGCGCC GCGGCTCAGG GAGGAGCACC GACTGCCCCG 60 

CACCCTGAGA GATG GTTGGT GCCATGTGGA AGGTGATTGT TTCGCTGGTC CTGTTGATGC 12 0 

CTGGCCCCTG TGGTGGGCTG TTTCACTCCC TATACAGAAG TGTTTCCATG CCACCTAAGG 18 0 

GAGACTCAGG ACAGCCATTA TTTCTCACCC CTTACATTGA AGCTGGGAAG ATTTATACAG 240 

GCACTAATTC AGTTTTCCAG ATATTTCCTG AATATAAAAA TAATGACTTT TATGTCACTG 300 

GGGAGTCTTA TGCAGGGAAA TATGTGC C AG CCATTGCACA CCTCATCCAT TCCCTCAACC 360 

CTGTGAGAGA GGTGAAGATC AAC CTGAACG GAATTGCTAT TGGAGATGGA TATTCTGATC 420 

CCGAATCAAT TATAGGGGGC TATGCAGAAT TCCTGTACCA AATTGGCTTG TTGGATGAGA 480 

AGCAAAAAAA GTACTTCCAG AAGCAGTGCC ATGAATGCAT AGAACACATC AGGAAGCAGA 540 

ACTGGTTTGA GGCCTTTGAA ATACTGGATA AACTACTAGA TGGCGACTTA ACAAGTGATC 600 

CTTCTTACTT CCAGAATGTT ACAGGATGTA GTAATTACTA TAACTTTTTG CGGTGCACGG 660 

AACCTGAGGA TCAGCTTTAC TATGTGAAAT TTTTGTCACT CCCAGAGGTG AGACAAGCCA 720 

TCCACGTGGG GAATCAGACT TTTAATGATG GAACTATAGT TGAAAAGTAC TTGCGAGAAG 780 

ATACAGTACA GTCAGTTAAG CCATGGTTAA CTGAAATCAT GAATAATTAT AAGGTTCTGA 840 

TCTACAATGG CCAACTGGAC ATCATCGTGG CAGCTGCCCT GACAGAGCGC TCCTTGATGG 900 

GCATGGACTG GAAAGGATCC CAGGAATACA AGAAGGCAGA AAAAAAAGTT TGGAAGATCT 9 60 

TTAAATCTGA CAGTGAAGTG GCTGGTTACA TCCGGCAAGT GGGTGACTTC CATCAGGTAA 1020 

Qi TTATTCGAGG TGGAGGACAT ATTTTAC CCT ATGACCAGCC TCTGAGAGCT TTTGACATGA 1080 

13 TTAATCGATT CATTTATGGA AAAGGATGGG ATCCTTATGT TGGATAAACT ACCTTCCCAA 1140 

Ml! AAGAGAACAT CAGAGGTTTT CATTGCTGAA AAGAAAATCG TAAAAACAGA AAATGTCATA 1200 

M GGAATAAAAA AATTATCTTT TCATATCTGC AAGATTTTTT TCATCAATAA AAATTATCCT 1260 
13 TGA 

C|| (2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
if! (A) LENGTH: 471 amino acids 

ff| (B) TYPE: amino acid 

gpi (C) STRANDEDNESS : single 

f% (D) TOPOLOGY: linear 



ra 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 1718107 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Val Lys Phe His Leu Leu Val Leu He Ala Phe Thr Cys Tyr Thr 

1 5 10 " 15 

Cys Ser Asp Ala Thr Leu Trp Asn Pro Tyr Lys Lys Leu Met Arg Gly 

20 25 30 

Ser Ala Ser Pro Pro Arg Pro Gly Glu Ser Gly Glu Pro Leu Phe Leu 

35 40 45 

Thr Pro Leu Leu Gin Asp Gly Lys He Glu Glu Ala Arg Asn Lys Ala 

50 55 60 

Arg Val Asn His Pro Met Leu Ser Ser Val Glu Ser Tyr Ser Gly Phe 
- 65 70 75 " 80 

Met Thr Val Asp Ala Lys His Asn Ser Asn Leu Phe Phe Trp Tyr Val 

85 90 A 95 

Pro Ala Lys Asn Asn Arg Glu Gin Ala Pro He Leu Val Trp Leu Gin 

100 ' 105 HO 

Gly Gly Pro Gly Ala Ser Ser Leu Phe Gly Met Phe Glu Glu Asn Gly 

115 120 125 

Pro Phe His lie His Arg Asn Lys Ser Val Lys Gin Arg Glu Tyr Ser 

130 135 140 

Trp His Gin Asn His His Met He Tyr He Asp Asn Pro Val Gly Thr 
145 150 155 160 

Gly Phe Ser Phe Thr Asp Ser Asp Glu Gly Tyr Ser Thr Asn Glu Glu 
165 170 175 
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ill o 


VaJL 


biy 


blU 
IOU 


Asn 


Leu 


IXlfcr U 


Lys 


jrne 

lOJ 


Tl 0 

lie 


HI n 

bin 


r*i n 
bin 


Jrlle 


fne 
1 qn 

1 _7 U 


val lieu 




DViq 


PrO 


Asn 
195 


Leu 


Leu 


Lys 


111 fa 


izJL kJ 

200 




Tyr 


lie 


Qov* 


biy 
205 


blU 


pei iyr 




biy 


r*i \r 
biy 

210 


Lys 


DV>o 

jrne 


vaj. 


Pro 


Al ^ 
Aid 

215 


irne 


briy 


Tyr 


Aid 


Tl i=s 

lie 
220 


rll S 


Asn 


Ser Gin 




Ser 


bin 


Pro 


Lys 


Tl a, 

Lie 


Asn 


Leu 


bin 


biy 


Leu 


Ala 


Tl /-» 

lie 


Gly 


Asp 


Gly Tyr 




225 










230 










235 








240 




Thr 


Asp 


Pro 


Leu 


Asn 
245 


C 1 v-t 

bin 


Leu 


Asn 


Tyr 


biy 

250 


blU 


Tyr 


Leu 


Tyr 


Glu Leu 
255 




biy 


Leu 


lie 


Asp 
zou 


Leu 


Asn 


biy 


Arg 


Lys 
0 & r 


Lys 


Fne 


Asp 


Glu 


Asp 

A / U 


f-p"l-\ 7\ "1 

x nr ai a 






rlld 


ne 

Z / J 


Aid 


Cys 


ai a 
Ala 


blU 


Arg 


Lys 


Asp 


i v ieu 


Asn 


Ser 

Zo D 


Aia 


Asn Arg 




L©U 


lie 
z _? u 


bin 




Leu 


irne 


Asp 

Zi? D 


biy 


Leu 


Asp 


biy 


bin 

jUU 


blU 


Ser 


iyr irne 




Lys 


Lys 


Vdl 


Thr 


biy 


FJie 


Ser 


Ser 


Tyr 


Tyr 


Asn 


Pne 


lie 


Lys 


Gly Asp 
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Lys 


bill 


Asp 


Ser 


Vdl 


Leu 




r*i n 

bill 


irne 


Leu 


Ser 


Asn Pro 
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r*i n 


T7"=a 1 
Vdl 


Arg 


Lys 


biy 


1 Jie 


His 


val 


biy 


Glu 


Leu 


Pro 


Phe 


His 


Asp Ser 


































i s 


Asp 


vziy 


rjL-L S 


Asn 


Lys 


Vdl 


ax a 


blU 


rien. 


Leu 


Ser 


blU 


Asp 


Thr 


Leu Asp 








1 r 

J .J J 










JOu 










0 cr 
jDj 






£ 


X 111 


val 


Al = 
Hid 


Pro 


Trp 


\ 7"=, T 

Vdl 


Ser 


Lys 


Leu 


Leu 


Ser 


JtllS 


Tyr 


Arg 


vdi ijeu 






370 










375 










380 










TYVi 

PxlS 


Tyr 


Asn 


Gly 


bin 


Leu 


Asp 


He 


He 


Cys 


Ala 


Tyr 


Pro 


Met 


Tnr Val 


s 


385 










390 










395 








400 


in. 


Asp 


Fine 


Leu 


Met 


Lys 


Met 


Pro 


Phe 
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(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 190283 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met He Arg Ala Ala Pro Pro Pro Leu Phe Leu Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Leu Val Ser Trp Ala Ser Arg Gly Glu Ala Ala Pro Asp Gin 

20 25 30 

Asp Glu He Gin Arg Leu Pro Gly Leu Ala Lys Gin Pro Ser Phe Arg 
35 40 45 
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